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AMENDMENTS TO THE CLAIMS 

The listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

1. (Original) A text mining method for extracting features of documents using a 
term-document matrix consisting of vectors corresponding to index terms representing the 
contents of the documents, wherein contributions of the index terms act on respective elements 
of the term-document matrix, said method comprising: 

a basis vector calculating step of calculating a basis vector spanning a feature 
space, in which mutually associated documents and terms are located in proximity with 
each other, based on a steepest descent method minimizing a cost; 

a feature extracting step of calculating a parameter for normalizing the features 
using the term-document matrix and the basis vector and extracting the features on the 
basis of the parameter; and 

a term-document matrix updating step of updating the term-document matrix to a 
difference between the term-document matrix, to which the basis vector is not applied, 
and the term-document matrix, to which the basis vector is applied. 

2. (Original) A text mining method for extracting features of documents as claimed 
in claim 1, wherein the cost is defined as a second-order cost of the difference between the term- 
document matrix, to which the basis vector is not applied, and the term-document matrix, to 
which the basis vector is applied. 



Page 2 of 12 



Application No. 09/871,272 
Amendment "A" dated September 2, 2004 
Reply to Office Action mailed July 2, 2004 

3. (Original) A text mining method for extracting features of documents as claimed 
in claim 2, wherein said basis vector calculating step comprises: 

an initializing step of initializing a value of the basis vector; 

a basis vector updating step of updating the value of the basis vector; 

a variation degree calculating step of calculating a variation degree of the value 
of the basis vector; 

a judging step of making a judgment whether a repetition process is to be 
terminated or not using the variation degree of the basis vector; and 

a counting step of counting the number of times of said repetition process. 

4. (Original) A text mining method for extracting features of documents as claimed 
in claim 3, wherein said basis vector updating step updates the basis vector using a current value 
of the basis vector, the term-document matrix and an updating ratio controlling the updating 
degree of the basis vector. 

5. (Original) A text mining method for extracting features of documents as claimed 
in claim 4, wherein, when all basis vectors and normalizing parameters required in extracting the 
features have been already obtained, the calculation of normalizing parameters in said basis 
vector calculating step and the execution of said feature extracting step are omitted, and said 
feature extracting step extracts the features using the basis vectors and the normalizing 
parameters that have been already obtained. 

6. (Original) A text mining method for extracting features of documents as claimed 
in claim 3, wherein, when all basis vectors and normalizing parameters required in extracting the 
features have been already obtained, the calculation of normalizing parameters in said basis 
vector calculating step and the execution of said feature extracting step are omitted, and said 
feature extracting step extracts the features using the basis vectors and the normalizing 
parameters that have been already obtained. 
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7. (Original) A text mining method for extracting features of documents as claimed 
in claim 2, wherein, when all basis vectors and normalizing parameters required in extracting the 
features have been already obtained, the calculation of normalizing parameters in said basis 
vector calculating step and the execution of said feature extracting step are omitted, and said 
feature extracting step extracts the features using the basis vectors and the normalizing 
parameters that have been already obtained. 

8. (Original) A text mining method for extracting features of documents as claimed 
in claim 1, wherein said basis vector calculating step comprises: 

an initializing step of initializing a value of the basis vector; 

a basis vector updating step of updating the value of the basis vector; 

a variation degree calculating step of calculating a variation degree of the value 
of the basis vector; 

a judging step of making a judgment whether a repetition process is to be 
terminated or not using the variation degree of the basis vector; and 

a counting step of counting the number of times of said repetition process. 

9. (Original) A text mining method for extracting features of documents as claimed 
in claim 8, wherein said basis vector updating step updates the basis vector using a current value 
of the basis vector, the term-document matrix and an updating ratio controlling the updating 
degree of the basis vector. 

10. (Original) A text mining method for extracting features of documents as claimed 
in claim 9, wherein, when all basis vectors and normalizing parameters required in extracting the 
features have been already obtained, the calculation of normalizing parameters in said basis 
vector calculating step and the execution of said feature extracting step are omitted, and said 
feature extracting step extracts the features using the basis vectors and the normalizing 
parameters that have been already obtained. 
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1 1 . (Original) A text mining method for extracting features of documents as claimed 
in claim 8, wherein, when all basis vectors and normalizing parameters required in extracting the 
features have been already obtained, the calculation of normalizing parameters in said basis 
vector calculating step and the execution of said feature extracting step are omitted, and said 
feature extracting step extracts the features using the basis vectors and the normalizing 
parameters that have been already obtained. 

12. (Original) A text mining method for extracting features of documents as claimed 
in claim 1, wherein, when all basis vectors and normalizing parameters required in extracting the 
features have been already obtained, the calculation of normalizing parameters in said basis 
vector calculating step and the execution of said feature extracting step are omitted, and said 
feature extracting step extracts the features using the basis vectors and the normalizing 
parameters that have been already obtained. 
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13. (Original) A text mining apparatus for extracting features of documents using a 
term-document matrix consisting of vectors corresponding to index terms representing the 
contents of the document, wherein contributions of the index terms act on respective elements of 
the term-document matrix, said apparatus comprising: 

basis vector calculating means for calculating a basis vector spanning a feature 
space, in which mutually associated documents and terms are located in proximity with 
each other, based on a steepest descent method minimizing a cost; 

feature extracting means for calculating a parameter for normalizing the features 
using the term-document matrix and the basis vector and extracting the features on the 
basis of the parameter; and 

term-document matrix updating means for updating the term-document matrix to 
a difference between the term-document matrix, to which the basis vector is not applied, 
and the term-document matrix, to which the basis vector is applied. 

14. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 13, wherein the cost is defined as a second-order cost of the difference between 
the term-document matrix, to which the basis vector is not applied, and the term-document 
matrix, to which the basis vector is applied. 

15. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 14, wherein said basis vector calculating means comprises: 

initializing means for initializing a value of the basis vector; 

basis vector updating means for updating the value of the basis vector; 

variation degree calculating means for calculating a variation degree of the value 
of the basis vector; 

judging means for making a judgment whether a repetition process is to be 
terminated or not using the variation degree of the basis vector; and 

counting means for counting the number of times of said repetition process. 



Page 6 of 12 



Application No. 09/871,272 
Amendment "A" dated September 2, 2004 
Reply to Office Action mailed July 2, 2004 

16. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 15, wherein said basis vector updating means updates the basis vector using a 
current value of the basis vector, the term-document matrix and an updating ratio controlling the 
updating degree of the basis vector. 

17. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 16, wherein, when all the basis vectors and normalizing parameters required in 
extracting the feature have been already obtained, the calculation of normalizing parameters by 
said basis vector calculating means and the execution of said feature extracting means are 
omitted, and said feature extracting means extracts the features using the basis vectors and the 
normalizing parameters that have been already obtained. 

18. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 15, wherein, when all basis vectors and normalizing parameters required in 
extracting the features have been already obtained, the calculation of normalizing parameters by 
said basis vector calculating means and the execution of said feature extracting means are 
omitted, and said feature extracting means extracts the features using the basis vectors and the 
normalizing parameters that have been already obtained. 

19. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 14, wherein, when all basis vectors and normalizing parameters required in 
extracting the features have been already obtained, the calculation of normalizing parameters by 
said basis vector calculating means and the execution of said feature extracting means are 
omitted, and said feature extracting means extracts the features using the basis vectors and the 
normalizing parameters that have been already obtained. 
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20. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 13, wherein said basis vector calculating means comprises: 

initializing means for initializing a value of the basis vector; 

basis vector updating means for updating the value of the basis vector; 

variation degree calculating means for calculating a variation degree of the value 
of the basis vector; 

judging means for making a judgment whether a repetition process is to be 
terminated or not using the variation degree of the basis vector; and 

counting means for counting the number of times of said repetition process. 

21. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 20, wherein said basis vector updating means updates the basis vector using a 
current value of the basis vector, the term-document matrix and an updating ratio controlling the 
updating degree of the basis vector. 

22. (Original) A text mining apparatus for extracting features of documents in text 
mining as claimed in claim 21, wherein, when all basis vectors and normalizing parameters 
required in extracting the feature have been already obtained, the calculation of normalizing 
parameters by said basis vector calculating means and the execution of said feature extracting 
means are omitted, and said feature extracting means extracts the features using the basis vectors 
and the normalizing parameters that have been already obtained. 

23. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 20, wherein, when all basis vectors and normalizing parameters required in 
extracting the features have been already obtained, the calculation of normalizing parameters by 
said basis vector calculating means and the execution of said feature extracting means are 
omitted, and said feature extracting means extracts the features using the basis vectors and the 
normalizing parameters that have been already obtained. 
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24. (Original) A text mining apparatus for extracting features of documents as 
claimed in claim 13, wherein, when all basis vectors and normalizing parameters required in 
extracting the features have been already obtained, the calculation of normalizing parameters by 
said basis vector calculating means and the execution of said feature extracting means are 
omitted, and said feature extracting means extracts the features using the basis vectors and the 
normalizing parameters that have been already obtained. 
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25. (Currently Amended) A computer program product comprising a computer 
readable medium, on which a computer program is stored, the computer program causing a 
computer to execute the text mining method as claimed in claim 1 . for being e x e cut e d in a text 
mining apparatus for e xtracting f e atur e s of docum e nts using a t e rm document matrix consisting 
of v e ctors corr e sponding to ind e x t e rms r e pr e s e nting th e cont e nts of th e docum e nts, wh e r e in 
contributions of th e ind e x t e rms act on r e sp e ctiv e e l e m e nts of the term docum e nt matrix, th e 
comput e r program product comprising: 

basis v e ctor calculating st e p of calculating a basis v e ctor spanning a f e ature 

spac e , in which mutually associat e d docum e nts and terms ar e locat e d in proximity with 
e ach oth e r, bas e d on a st ee p e st d e scent m e thod minimizing a cost; 

f e atur e e xtracting st e p of calculating a paramet e r for normalizing th e f e atur e s 

using th e t e rm docum e nt matrix and th e basis v e ctor and e xtracting th e f e atur e s on th e 
basis of th e param e t e r; and 

t e rm docum e nt matrix updating st e p of updating th e t e rm docum e nt matrix to a 

diff e r e nc e b e tw ee n th e t e rm docum e nt matrix, to which th e basis v e ctor is not applied, 
and th e t e rm docum e nt matrix, to which the basis v e ctor is appli e d. 
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